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Abstract 



This paper considers the problera of transferring a file from one source node to multiple receivers in a 
peer-to-peer (P2P) network. The objective is to minimize the weighted sum download time (WSDT) for 
the one-to-many file transfer. Previous work has shown that, given an order at which the receivers finish 
downloading, the minimum WSD can be solved in polynomial time by convex optimization, and can be 
achieved by linear network coding, assuming that node uplinks are the only bottleneck in the network. 
This paper, however, considers heterogeneous peers with both uplink and downlink bandwidth constraints 
specified. The static scenario is a file-transfer scheme in which the network resource allocation remains 
^ ' static until all receivers finish downloading. This paper first shows that the static scenario may be optimized 

in polynomial time by convex optimization, and the associated optimal static WSD can be achieved by 
linear network coding. This paper then presented a lower bound to the minimum WSDT that is easily 
computed and turns out to be tight across a wide range of parameterizations of the problem. This paper 
also proposes a static routing-based scheme and a static rateless-coding-based scheme which have almost- 
optimal empirical performances. The dynamic scenario is a file-transfer scheme which can re-allocate the 
, network resource during the file transfer. This paper proposes a dynamic rateless-coding-based scheme, 

■ which provides significantly smaller WSDT than the optimal static scenario does. 
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I. Introduction 



P2P applications (e.g, [T], [2], [3], [1]) are increasingly popular and represent the majority 
of the traffic currently transmitted over the Internet. A unique feature of P2P networks 
is their flexible and distributed nature, where each peer can act as both a server and a 
client [5]. Hence, P2P networks provide a cost-effective and easily deployable framework for 
disseminating large files without relying on a centralized infrastructure (6j. These features 
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of P2P networks have made them popular for a variety of broadcasting and file-distribution 
apphcations [6] [7] [8] [9] [10] [H] [12]. 

Specifically, chunk-based and data-driven P2P broadcasting systems such as CoolStream- 
ing [7] and Chainsaw [8] have been developed, which adopt pull-based techniques [7], [8]. In 
these P2P systems, the peers possess several chunks and these chunks are shared by peers 
that are interested in the same content. An important problem in such P2P systems is 
how to transmit the chunks to the various peers and create reliable and efficient connections 
between peers. For this, various approaches have been proposed including tree-based and 
data-driven approaches (e.g. [lO] [13] [H] [I5] [IS] [H] [H]). 

Besides these practical approaches, some research has begun to analyze P2P networks from 
a theoretic perspective to quantify the achievable performance. The performance, scalability 
and robustness of P2P networks using network coding are studied in [19] [20]. In these 
investigations, each peer in a P2P network randomly chooses several peers including the 
server as its parents, and also transmits to its children a random linear combination of all 
packets the peer has received. Random linear network coding [21] [22] [23], working as a 
perfect chunk selection algorithm, makes elegant theoretical analysis possible. Some other 
research investigates the steady-state behavior of P2P networks with homogenous peers by 
using fluid models [21] [25] [26] . 

In a P2P file transfer application (e.g. Bit Torrent [1], Overcast [12j), the key performance 
metric from an end-user's point of view is the download time, i.e., the time it takes for 
an end-user to download the file. In [9], Li, Chou, and Zhang explore the problem of 
delivering the file to all receivers in minimum amount of time (equivalently, minimizing the 
maximum download time to the receivers) assuming node uplinks are the only bottleneck 
in the network. They introduce a routing-based scheme, referred to as Mutualcast, which 
minimizes the maximum download time to all receivers with or without helpers. 

This paper also focuses on file transfer applications in which peers are only interested in 
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the file at full fidelity, even if it means that the file does not become available to all peers 
at the same time. In particular, this paper considers the problem of minimizing weighted 
sum download time (WSDT) for one-to-many file transfer in a peer-to-peer (P2P) network. 
Consider a source node s that wants to broadcast a file of size S to a set of N receivers 
i e {1, 2, • • • , N} in a P2P network. Our model assumes that the source uphnk bandwidth 
constraint Us, the peer uplink bandwidth constraints Ui, and the peer downlink bandwidth 
constraints Di are the only bottlenecks in the network. Limited only by these constraints, 
every peer can connect to every other peer through routing in the overlay network. 

In order to understand the fundamental performance limit for one-to-many file transfer 
in P2P networks, it is assumed that all nodes are cooperative, and a centralized algorithm 
provides the file-transfer scenario with the full knowledge of the P2P network including the 
source node's uplink capacity , and the weights, downlink capacities, and uplink capacities 
of peers. The cooperative assumption holds in many practical applications, for example, in 
closed content distribution systems where the programs are managed by a single authority. 

II. Main Contribution 

The general problem of minimizing WSDT divides into an exhaustive set of cases according 
to three attributes. The first attribute is whether the allocation of network resources is static 
or dynamic. In the static scenario, the network resource allocation remains unchanged from 
the beginning of the file transfer until all receivers finish downloading. The dynamic scenario 
allows the network resource allocation to change as often as desired during the file transfer. 

The second attribute is whether downlink bandwidth constraints are considered to be 
unlimited (i.e. Di = oo) or not (i.e. Di < oo). Most research in P2P considers the download 
bandwidth constraints to be unlimited because the uplink capacity is often several times 
smaller than the downlink capacity for typical residential connections (e.g., DSL and Cable). 
However, consideration of downlink bandwidth constraints can be important. The downlink 
capacity can still be exceeded when a peer downloads from many other peers simultaneously, 
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as in the routing-based scheme proposed in [27] . 

The third attribute is whether we consider the special case of sum download time (i.e. 
Wi = 1 for all i) or the general case of weighted sum download time which allows any values 
of the weights Wi. 

With these cases in mind, here is an overview of the results presented in this paper. For 
the static scenario that considers download bandwidth constraints Di < oo and allows any 
values of Wi, Section UTTl uses a time-expanded graph and linear network coding to show that 
the minimum WSDT and the corresponding allocation of network resources may be found in 
polynomial time by solving a convex optimization problem. We also present a lower bound 
on minimum WSDT that is easily computed and turns out to be tight across a wide range 
of parameterizations of the problem. 

While the minimum WSDT for the static scenario may be found in polynomial time 
using the approach of Section IIIIl that approach is sufficiently computationally intensive 
that Sections IIVI and |V] provide lower complexity alternatives. In some cases, the lower 
complexity approaches are exactly optimal. For the remaining cases, the lower bound of 
Section IIIIl shows that their performance is indistinguishable from the lower bound and 
hence closely approach optimality across a wide range of parameterizations. 

Sections [IV] and |V] build on the foundation of the Mutualcast algorithm [9]. Mutualcast is 
a static rate allocation algorithm designed to minimize the maximum download time to all 
peers in the case where Di = oo. Section Unl concludes by showing that Mutualcast achieves 
that section's lower bound when Wi = 1 for all i and therefore minimizes sum download 
time as well as maximum download time. 

Inspired by this result. Section HVl proposes a generalization of this algorithm. Extended 
Mutualcast, that minimizes sum download time even when the download bandwidth con- 
straints Di are finite and distinct from each other. When uplink bandwidth resources are 
plentiful. Extended Mutualcast also minimizes weighted sum download time regardless of 
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weights because each receiver is downloading content as quickly as possible given its down- 
load bandwidth constraint and the upload bandwidth constraint of the source. 

It is notable that Mutualcast and Extended Mutualcast achieve their optimal results while 
utilizing only depth- 1 and depth-2 trees. Inspired by this fact and the technique of rate- 
less coding, Section |V] attacks the general problem of minimizing weighted sum download 
time(WSDT) by proposing a convex optimization approach that assumes only trees of depth 
one or two. Then, Section |V] proposes a simple water- filling approach using only depth- 1 
and depth-2 trees. While the optimality of this approach is not proven. Section IV-EI shows 
that its performance matches that of the lower bound of IIIII for a wide variety of param- 
eterizations. Thus this water-filling approach provides a simple algorithm that empirically 
achieves the lower bound on WSDT for all cases of the static scenario across a wide range 
of parameterizations. 

Turning our attention to the dynamic scenario, Wu et al. |27] demonstrate that given an 
order in which the receivers finish downloading, the dynamic allocation (neglecting downlink 
bandwidth constraints) that minimizes WSDT can be obtained in polynomial time by convex 
optimization and can be achieved through linear network coding. They also propose a 
routing-based scheme which has almost-optimal empirical performance and demonstrate how 
to significantly reduce the sum download time at the expense of a slight increase in the 
maximum download time. 

Dynamic schemes can reduce the minimum sum download time to approximately half that 
of the static case, at least when downlink capacities are considered to be infinite |27]. Es- 
sentially, [27] shows that to optimize WSDT the network resource allocation should remain 
constant during any "epoch" , a period of time between when one receiver finishes download- 
ing and another finishes downloading. Thus, one optimal solution for the dynamic scenario 
is "piecewise static". However, [27] leaves the proper selection of the ordering as an open 
problem and does not address the finite downlink capacities < oo or the general case of 
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weighted sum download time which allows any values of the weights Wi. 

Section IVT] provides a practical solution for the dynamic scenario. Specifically, it provides 
an approach the ordering problem left open by [27] by reformulating the problem as that of 
determining the weights that should be assigned during each static epoch so as to produce 
the piecewise static solution that minimizes the WSTD (according to the original weights). 
This approach handles both finite downlink capacities Di < oo and the general case of 
weighted sum download time which allows any values of the weights Wi. A key result of 
this section is that, regardless of how the overall weights Wi are set, the "piecewise static" 
solution may be obtained by finding the appropriate weights for each epoch and solving the 
static problem for that epoch. Furthermore, during any epoch the appropriate weights of 
all peers are either 1 or zero with the exception of at most one "transitional" peer whose 
weight can be anywhere between zero and 1. Neglecting the "transitional" node, the ordering 
problem becomes approximately one of choosing which peers should be served during each 
epoch. Having resolved the ordering problem in this way, the simple water-filling approach 
of Section |V] provides the rate allocations for the source and for each peer during each of 
the piecewise-static epochs. Thus this section provides a complete solution for the dynamic 
scenario. Because the selection of the ordering and the rate allocation are both close to 
optimal, we conjecture that the overall performance of this solution is close to optimal 
across a wide range of parameterizations. 

Section IVIIII delivers the conclusions of this paper. 

III. Convex Optimization of WSDT in the Static Case 

This section considers a static P2P network in which the source node with uplink band- 
width Us seeks to distribute a file of size B so as to minimize the weighted sum of download 
times given a static allocation of resources. The static scenario assumption also indicates 
that no peer leaves or joins during the file transfer. There are N peers who want to down- 
load the file that the source node has. Each peer has weight Wi, downlink capacity Di and 

6 



uplink capacity f/j, for i = 1,2, ■ ■ ■ , N . It is reasonable to assume that Di > Ui for each 
i = 1, - ■ ■ ,N since it holds for typical residential connections (e.g., Fiber, DSL and Cable). 
In case of < Ui for some z, we just use peer i's part of the uplink capacity which equals 
to its downlink capacity and leave the rest of the uplink capacity unused. 

The uplink and downlink capacities of each peer are usually determined at the application 
layer instead of the physical layer, because an Internet user can have several applications 
that share the physical downlink and uplink capacities. The peer weights depend on the 
applications. For broadcast applications such as CoolStreaming [7] and Overcast [12jin 
which all peers in the P2P network are interested in the same content, all peer weights in 
the content distribution system can be set to 1. In multicast applications such as "Tribler" 
[25] peers called helpers, who are not interested in any particular content, store part of 
the content and share it with other peers. Assign weight zero to helpers, and weight 1 to 
receivers. In some applications, P2P systems partition peers into several classes and assign 
different weights to peers in different classes. 

Denote the transmission rate from the source node to peer j as Vg^j and the transmission 
rate from peer i to peer j as Vi^j. The total download rate of peer j, denoted as dj, is the 
summation of Vg^j and ri^j for all i ^ ]■ Since the total download rate is constrained by 
the downlink capacity, we have 

dj = r^^j + ^ ri^j < Dj,\fj = 1, - ■ ■ , A^. (1) 

As a notational convenience, we also denote rj^j as the transmission rate from the source 
node to peer j so that 

N 

d, = J2n^j<Dj,^j = i,--- ,N. (2) 

i=l 

The total upload rate, denoted constrained by the uplink capacity. Hence, we also 

have Uj = Yli^j ^j^i ^ for all j = 1, ■ ■ ■ , A^. 
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Fig. 1 
The peer model 



One example of the peer model is shown in Fig. [TJ The downlink capacity and uplink ca- 
pacity of peer 1 are Di and Ui respectively. Thus, the total download rate rs^i + J2t=2 ^i^^ ~ 
'^f^i Ti^i has to be less than or equal to Di, and the total upload rate J2t=2 '^i^* t)e 
less than or equal to Ui. 

A. The Time-Expanded Graph 

As one of the key contributions of [27], Wu et al. used a time-expanded graph to show 
how the dynamic scenario decomposes into epochs. This section applies the time-expanded 
graph approach provided in [27j to the static case. 

To obtain the time-expanded graph for a P2P network with peers, we need to divide 
the time into epochs according to the finishing times of the peers. One peer finishes 
downloading at the end of each epoch so that the number of epochs is always equal to the 
number of peers. Let Atj denote the duration of the i-th epoch. Hence, i receivers finish 
downloading by time ti = ^^=1 At^. If peers i and i + 1 finish downloading at the same 
time, Atj+i = 0. 

Each vertex in the original graph G corresponds to A^ vertices, one for each epoch, in the 
time-expanded graph G^^^ as follows: We begin with the original P2P graph G with node set 
V = {s,l, ■ ■ ■ , N} and allowed edge set E. For each v E V and each n G {1, ■ ■ ■ , A^}, G*-^-* 
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includes a vertex f '^"■^ corresponding to the associated physical node v in the n-th epoch. For 
each e G E going from m to f and each n G {1, ■ ■ ■ , A^}, G^^'^ includes an edge e*^"-* going 
from u^"^ to v^"^ corresponding to the transmission from u to v during the n-th. epoch. 

The subgraph G^"^^ = {V^"'\ E^"''^) for n = I,-- - characterizes the network resource 
allocation in the n-th epoch. To describe a rate allocation in the original graph G, edges are 
typically labeled with the rate of information flow. However, since each epoch in the time- 
expanded graph G^^^ has a specified duration, each of the N edges in the time-expanded 
graph corresponding to an edge in G is labeled with the total amount of information flow 
across the edge during its epoch. This is the product of the flow rate labeling that edge in 
the original graph G and the duration of the epoch. 

The time-expanded graph also includes memory edges. For each v & V and each n G 
{1, ■ ■ ■ , — 1}, there is an edge with infinite capacity from f^"^ to t>("+^). These memory 
edges reflect the accumulation of received information by node v over time. 

As just described, the time-expanded graph not only describes the network topology, but 
also characterizes the network resource allocation over time until all peers finish downloading 
in a P2P network. As shown in |27] by Wu et al., even in the dynamic scenario the network 
resource allocation can remain static throughout each epoch without loss of optimality. In 
this section, we apply the time-expanded graph to the static scenario in which the rate 
allocation remains fixed for the entire file transfer. 

As an example, consider the following scenario. A P2P network contains a source node 
seeking to disseminate a file of unit size {B = 1). Its upload capacity is Us = 2. There 
are three peers {1,2,3} with upload capacities Ui = U2 = U3 = 1 and download capacities 
^1 = ^2 = ^3 = 00. 

Fig. [2] gives one possible static rate allocation, showing the allocated rate for each edge of 
the original P2P graph G. (Edges with zero allocated rate are not shown.) The source node 
transmits with a rate of 1 to peer 1 and with rate 1/2 to peers 3 and 4. Peer 1 transmits 
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Fig. 2 

An example P2P graph G. Edges are labeled with one possible rate allocation 
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The time-expanded G^^^ for the P2P graph G shown in Fig. [2J Edges are labeled with the 

TOTAL information FLOW ALONG THE EDGE DURING THE EPOCH. THIS IS THE PRODUCT OF THE RATE 
allocation ALONG THE EDGE (fROM THE GRAPH IN FiG. [2|) AND THE DURATION OF THE EPOCH. 

Edges with zero flow are not shown. 
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with rate 1 to peer 2 but does not transmit to any other peers. Peers 2 and 3 transmit with 
rate 1/2 to Peer 1, but do not transmit to any other peers. 

Fig. [3] shows the time-expanded graph induced by the static rate allocation shown in 
Fig. [21 Because there are three peers, this time-expanded graph has 3 epochs. The peers are 
numbered in the order they finish downloading; peer 1 finishes first followed by peer 2 and 
then peer 3. The first epoch lasts Ati = 1/2 time units, the second epoch lasts At2 = 1/6 
time units, and the third epoch lasts At^ = 4/3 time units. 

Peer 1 finishes first because it sees the full upload capacity of the source. As shown in 
Fig. [2] it sees rate 1 directly from the source. The other half of the source upload capacity is 
relayed to peer 1 by peers 2 and 3 immediately after they receive it. Hence peer 1 receives 
information with an overall rate of ri = 2 and finishes downloading the entire file, which has 
size B = 1 at time ti = 1/2. As a result, the duration of the first epoch is Ati = 1/2. 

Peer 2 sees rate 1/2 directly from the source and rate 1 relayed to peer 2 by peer 1. Hence 
it sees an overall upload capacity of r2 = 3/2 and finishes downloading the entire file at time 
^2 = 2/3. The duration of the second epoch can be computed as ^2 — = 1/6. 

Because it receives no help from the other two peers, peer 3 sees an overall upload rate of 
only r3 = 1/2, which it receives directly from the source. It finishes downloading the entire 
file at time ^3 = 2. The duration of the third epoch can be computed as — t2 = 4/3. 

The sum of the download times for the example of Figs. [2] and [3] is 1/2 + 2/3 + 2 = 3 1/6 . 
Now let's consider an example that minimizes the sum of the download times and in which 
peers finish at the same time. 

Fig. m shows the rate allocation that achieves the minimum possible sum of download 
times for a static allocation in this scenario, which turns out to be 1 4/5. The allocation 
shown in Fig. [His perfectly symmetric. Each peer receives rate 2/3 directly from the source 
and rate 1/2 from each of the two other peers. Each peer receives an overall rate of 5/3. 
Hence all three peers finish downloading simultaneously at t = 3/5 and the second and third 
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Fig. 4 

An example P2P graph G. Edges are labeled with the rate allocation r^^j that minimizes 

THE SUM OF THE DOWNLOAD TIMES. 
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The TIME-EXPANDED G^^^ FOR THE P2P GRAPH G SHOWN IN FiG. [H EDGES ARE LABELED WITH THE 
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ALLOCATION ALONG THE EDGE (fROM THE GRAPH IN FiG. |1])AND THE DURATION OF THE EPOCH. 

Edges with zero flow are not shown. 
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epochs have zero duration. 

B. Transmission Flow Vectors and a Basic Network Coding Result 

In Section IIII-AI there was a tacit assumption that all of the information received by a 
peer is useful. For example, we assumed that the information relayed from peer 2 to peer 1 
did not repeat information sent from the source to peer 1. In the examples of Section flll-At 
one can quickly construct simple protocols that ensure that no critical flows are redundant. 
In this subsection, we review a general result that uses network coding theory to show that 
there is always a way to ensure that no critical flows are redundant. 

Consider a general graph G = (V, E), which could be either a rate-allocation graph G such 
as Figs. [2] or m or a time-expanded graph such as G^^^ described in Figs. [3] and [51 Denote 
c(e) as the capacity of the edge e G -E. A transmission flow from the source node s to a 
destination node i is a nonnegative vector / of length \E\ satisfying the flow conservation 
constraint: excesSy{f) = 0,Wv E V\{s,i}, where 

excess.(/)= /(^) " E /(^)- (3) 

e(iln{v) e(iOut{v) 

The total flow supported by / is Ylie<^Out{s) /(e)- This "flow" could be a flow rate with units 
of bits per unit time if we are considering a rate allocation graph such as Fig. [2] or it could 
be a total flow with units of bits or packets or flies if we are considering a time-expanded 
graph such as Fig. [31 

As an example, the flow vector / describing the flow in Fig. [31 from S**-^^ (the source in 
the flrst epoch) to destination node 2*^^^ (peer 2 in the second epoch, when peer 2 finishes 
downloading) has the nonzero elements /(e) shown in Table [H Examining Table [I verifies 
that the flow conservation constraint ([3]) is satisfled and that the total flow supported is 
equal to 1 flle. 
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TABLE I 

Table showing nonzero elements /(e) for the flow vector / from S'^^^ to 2^^^ in Fig. EJ 



e 


/(e) 




1/4 


sw ^ 1(1) 


1/2 


1(1) ^ 2(1) 


1/2 


2(1) ^ 2(2) 


3/4 


5(1) _> 5(2) 


1/4 


5(2) ^ 2(2) 


1/12 


5(2) ^ 1(2) 


1/6 


1(2) ^ 2(2) 


1/6 



The following lemma states that a given fixed fiow (or fiow rate) can be achieved from 
the source to all destinations as long as there is a feasible fiow vector supporting the desired 
fiow from the source to each destination, i.e. We can achieve this fiow to all destinations 
with network coding without worrying about possible interactions of the various fiows.. 

Lemma 1: (Network Coding for Multicasting |21| p2j ) In a directed graph G = 
(V, E) with edge capacity specified by a vector c , a multicast session from the source node 
s to a set of receivers i G {1, . . . , A^} can achieve the same fiow r for each i G {1, . . . , A^} if 
and only if there exits a set of fiows {/j} such that 



c > max/i (4) 

where is a fiow from s to z with fiow r. Furthermore, if (jl]) holds, there exists a linear 
network coding solution. 

C. A Convex Optimization 

Given an order in which the peers will finish downloading, say peer i finishes at the end 
of the fcj-th epoch, applying Lemma [U to the time-expanded graph G'(^) with the set of 
destination nodes i G {1, . . . , A^} gives a characterization of all feasible downloading times, 
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as concluded in the following lemma. 

Lemma 2: (Feasible Downloading Times with Given Order [27j ) Consider a P2P 
network in which node Dj = oo. Given an order in which the nodes will finish downloading 
a file with size B, say node i finishes at epoch ki, a set of epoch durations Ati is feasible if 
and only if the following system of linear inequalities has a feasible solution: 

At, >0, i = l,---,N, (5) 
g>f„ z = l,---,iV, (6) 
XI 9uii-)^v(i) < Cu^U, Vm G V, i = 1, ■ ■ ■ , iV, (7) 

where Cu is the uplink capacity of peer u, and is a fiow from first-epoch source node s*-^-* 
to node i's termination-epoch node i^''^^ with fiow rate B. 

As an example, the epoch durations of Fig. [3] are feasible because each of the fiow vectors 
(one example was given in Table [T]) satisfy the feasibility constraints of Lemma [2J 

Let tj denote the download time to peer j for j = I,-- - ,N. Given a static network 
resource allocation ri^j, G {1, ■ ■ ■ , A^}) as shown for example in Fig. [2l the maximum 
fiow to peer j, denoted equal to the minimum cut between source node s and peer 

j in the rate-allocation graph (i.e. a graph such as Fig. [2], not the time-expanded graph). 
This follows from the Max-Flow-Min-Cut Theorem. Hence, tj > ^, Vj. 

From applying network coding results such as Lemma [T]to the rate allocation graph alone, 
we cannot conclude much about feasible download times since Lemma [1] addresses only the 
feasibility of the same fiow to all destinations. However, by applying Lemma [1] to the time- 
expanded graph we can show that tj = ^ can be achieved simultaneously for all j = 1, ■ ■ ■ , A^. 
Lemma E] below states this result. 

Lemma 3: Given a static network resource allocation Vi^j, = 1, ■ ■ ■ , A^), for a P2P 
network, the only Pareto optimal (smallest) download time vector is = ^ for j = 1, ■ ■ ■ , A^, 
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where vj is the minimum cut from the source node s to peer j. 



Proof: It has been shown that tj > ^ for j = 1, - ■■ ,A^. Hence, it is sufficient to 
show that tj = ^ for j = 1, ■ ■ ■ , is achievable. Without loss of generality, assume that 
> r2 > ■ ■ ■ > rj\i > 0. Construct a static scheme (i.e. a time-expanded graph G^^'^) as 
follows: 

(1) AU = f-- where tq = oo; 

(2) Flow capacity of edge i^^^ — )■ j'^^^ is ri^jAtk ior 1 < i ^ j < N and A; = 1, ■ ■ ■ , A^; 

(3) Flow capacity of edge s^'^'^ — t- j^''^ is r^^j At^ for j, k = 1, ■ ■ ■ , N; 

(4) Flow capacity of edge j'-'^^ — )■ is infinity for j = 1, ■ ■ ■ , and k = 1, - ■ ■ , N — 1; 

(5) The destination nodes in the time-expanded graph are node i*-*-* for i = 1, ■ ■ ■ , N. In 
other words, peer i finishes at the end of i-th epoch. 

According to the constructions (1) and (5), the download time to peer z is = Ylk=i = f-- 
According to the constructions (2) and (3), in the subgraph G^''\ the maximum fiow from 
s^'^^ to i^''^ is equal to ViAtk ioi all i, k = 1, ■ ■ ■ , N . Therefore, in this time-expanded graph 
G^'^\ the maximum fiow from source node s to node i^*^ is greater than or equal to 

i 

J2 ri^tk = B. 

k=l 

Therefore, by Lemma [1] and Lemma [21 there exists a linear network coding solution to 
multicast a file with size B from the source node s to peer i within download time ti = ^ 
for alH = I,--- ,Ar. ■ 

The maximum fiow can be found by solving a linear optimization. Specifically, a set of 
fiow rates {ri}f^i is feasible if and only if there exists a solution to the following system of 
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linear inequalities: 



r,^,->0, J = !,■■■ ,N- 



N 



^^i^i < Us] (recall that r^^j = r^^i) (9) 
1=1 

N 

r^~^j<U^, V2 = l,---,iV; (10) 

N 

J2r^^^<D„ Vi = l,---,iV; (11) 



0</Si<r.^i,Vz,j,fc = l,---,iV; (12) 

/S, = o, VjVfc; (13) 

N N 

E/i-= E V.^A:; (14) 

5]/E>^fc, VA; = l,---,iV, (15) 



j=i 



where Vi^j = I,-- - ,N) represents the network resource allocation and fj;'^j = 
1, ■ ■ ■ , A^) is a flow from the source node s to peer k. 

By Lemma[31 the minimum WSDT is the solution to the convex optimization of minimizing 
T.i=lW^B/r^ subject to Thus, we can conclude the following theorem: 

Theorem 1: Consider multicasting a file with size B from a source node s to peers {1, ■ ■ ■ , A^} 
in a P2P network with both uplink and downlink capacity limits. The minimum weighted 
sum downloading time for the static scenario and the corresponding optimal static allo- 
cation can be found in polynomial time by solving the convex optimization of minimizing 
Yl,k=i ^kB/vi subject to the constraints flSl fT^ . 
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Theorem [T] gives a solution to the most general static case that we are considering in this 
paper. However, it can be extended further by adding other linear network constraints (e.g. 
edge capacity constraints), which are not a concern of this paper. 



D. The Uplink-Bandwidth-Sum Bound 

For a P2P network with a source node and N peers, the convex optimization in Theorem 
[T]has + A^^ + variables and 2A^'^ + 3A^^ + + 1 linear constraints. The complexity for 
the interior point method to solve this convex optimization is 0{{N^f-^) [29]. 

Even though the convex optimization can be solved in polynomial time, its complexity is 
still too high for practical applications when is large. Hence, bounds on the minimum 
WSDT and static schemes having network resource allocations that may be computed with 
low complexity are desired. In this subsection, we provide an analytical lower bound to 
the minimum WSDT with 0{N'^) complexity for computing both the bound itself and the 
associated rate allocations. 

Consider the cut of {V \i} ^ {i} for any static allocation rj^j i,j G {1, ■ ■ ■ , A^}, the 
maximum flow rate from the source node s to peer i, r^, is limited by 



N 




(16) 



and 



N 



N 



N 




(17) 



i=l 



N 



N 



N 




(18) 



N 




(19) 
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Consider the cut of {s} — )• {1, ■ ■ ■ 



A^}, Tj is also bounded by 



N 



(20) 



Inequalities ( fT6|) and ( l20|) indicate that the downloading flow rate for peer i is limited by 
peer z's downlink capacity and the source node's uplink capacity respectively. These two 
constraints are not only valid for the static scenario but also for dynamic scenarios. 

Inequality ( fT9|) shows that the sum of the downloading flow rates for all peers is bounded 
by the total amount of the network uplink resource. Again, this constraint holds in both the 
static and dynamic cases. 

These three constraints characterize an outer bound to the region of all feasible sets of 
{ri}fLi satisfying flHl- [T5l) . Therefore, for any static scheme, every set of feasible flow rates 
{ri}fLi must satisfy ([IS]), ([I9]) and dlO]). However, not all {rj^^^ satisfying (1T6|). (fT9l) and 
f l20|) are feasible. 

Consider the following example: Let B = 1, Us = 3, and Ui = U2 = U3 = 1 (with 
Di = D2 = = 00), the downloading flow rates ri = r2 = 3, rs = satisfies the constraints 
( IT6|) . ( fT9|) and ( 120|) . but are not feasible because there is no solution to (IH]-[l5]) with ri = r2 = 
3,r3 = 0, i.e., no static scenario to support ri = r2 = 3,r3 = simultaneously. Specifically, 
for ri + r2 = 6, all upload capability must be deployed, including that of peer 3. However, 
since = 0, any transmission by peer 3 would violate the conservation-of-flow constraint. 

Because all feasible sets of {ri}fLi satisfy f|T6|) f flQl) and fl20|) . the solution to the following 
minimization problem provides a lower bound to the minimum WSDT for the static scenario: 



mm 



subject to 



0<ri<Di^ min(A, Us),\/t = 1, ■ ■ ■ , iV, 



(21) 
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where only (z = 1, ■ ■ ■ , A^) are the variables. Empirical experiments presented in Section 
IV-EI show that this lower bound is tight for most P2P networks. 

The minimization problem ( 12T|) is a convex optimization. Its optimal solutions are also 
the solutions to the associated Karush—Kuhn— Tucker (KKT) conditions [29j. The KKT 
conditions for problem (l2Ti) are 

+ A + = 0, t = l,---,N; (22) 

N N 

J]r,-t/,-5^?7, <0, A>0; (23) 

i=l 1=1 

ri-D^< 0, in > 0; (24) 

N N 

X{J2r^-U,-Y,U^) = 0■ (25) 

i=l 1=1 

/i,(r,-A) = 0, 2 = l,---,iV. (26) 
Solving the KKT conditions yields the following optimal solution for {rj}: 



Di if ^/W, ■R> D, 



where R is chosen such that 



N N N 



i=l i=l i=l 



The lower bound to the WSDT for the static scenario is then 



AT ^ 



i=l 1=1 « 



(27) 



^ r* = min(^, + E E ^^)- (^8) 



J2^^t.>J2^.-, (29) 

with r* as specified in ( 1271) . 



20 



For the special case where Wi = 1 and Di = oo {i = 1, - ■ ■ , N), the solution given in ([2 
becomes 

r; = min(f/„^^^±^^), (30) 
and the lower bound to the minimum WSDT is 

J2U> ■ (31) 

tr min([/„ ^-+^-^^' ) 

Mutualcast [D] was designed to minimize the maximum download time for the case where 
Di = oo. However, since Mutualcast can achieve the download time of rr'^, ^iv r, for 

ram{Us, ^ ) 

all peers, it achieves the lower bound of ( l29l) for the Wi = 1 case. This fact shows both that 
the lower bound of ( l29i) is tight when Wi = 1 and Di = oo and that Mutualcast minimizes 
sum download time as well as the maximum download time when Di = oo. 

IV. Mutualcast and Extended Mutualcast for the Equal- Weight Static 

Case 

The concluding paragraph of Section IIII-DI stated that Mutualcast minimizes the sum 
download time for the case where Di = oo. In this section we extend Mutualcast to provide 
an algorithm we call Extended Mutualcast that handles finite constraints on Di (possibly 
delivering different rates to different peers) while still minimizing the sum download time. 

A. Mutualcast 

Mutualcast delivers the same rate to every peer. Assuming Di = oo, Mutualcast can 
support peers with any rate R < mm{Us, " ' ). The key aspect of Mutualcast is that 

the source first delivers bandwidth to each node according to how much that node can share 
with all other peers. After that, if the source has any upload bandwidth left over, it is divided 
evenly among all peers. This leftover rate goes serves only one peer; it is not relayed to any 
other peers. Thus Mutualcast first forms a series of depth-two trees from the source to all 
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nodes. Then, if there is any source upload bandwidth left over, it is used to form a series 
of depth-one trees. Here is a specification of the Mutualcast algorithm (without considering 
helper nodes): 



Algorithm 1 The Mutualcast Algorithm for Network Resource Allocation 

1: Given broadcast rate R < mm{Us, " ' )• 

2: Given an ordering of the peers. (Without loss of generality, assume the order is 1, ■ ■ ■ ,N.) 

3: for i = 1 to A do 

4: r,^i^mm{R,Ui/{N -1)). 
5: ri_,j ^ Ts^i for j ^ i. 

6: R R — Ts^i- 

7: Us ^Us - Ts^i. 

8: end for 

9: Ts^i ^ Vs^i + R. 



Mutualcast delivers information to all peers at the same rate. As described in Algorithm 
[T]the highest rate that Mutualcast can deliver is 

R = mm{Us, ^^ + y=^^' ). (32) 

Consider two examples with ten peers, one in which R = Ug and one in which R = " 

First is an example where R = Ug. Note that in general it is not possible for any peer 
to receive information at a rate higher than Ug- Let Ug = 1, Ui = 1 for all ten peers, and 
Di = oo for all ten peers. Mutualcast achieves R = Ug = 1 hj having nine peers receive rate 
1/9 from the source and forward at that rate to the nine other peers. One peer receives no 
information directly from the source because by the time the Mutualcast algorithm gets to 
that peer, the source upload bandwith has been used up. 

For an example where R = " ' , a larger Ug is necessary. Let t/g = 10, ?7j = 1 for 

all ten peers and Di = oo for all ten peers. Mutualcast achieves R = " ^^^^ ' =2. In 
the first part of the Mutualcast algorithm, all ten peers receive rate 1/9 from the source and 
relay at that rate to the nine other peers. At this point there remains 80/9 of source upload 

22 



bandwidth, which is distributed evenly so that each peer receives a rate of 8/9 directly from 
the source that it does not relay. In total, each peer receives rate 2 which is comprised of 
rate 1 from other peers, rate 1/9 from the source that it relays to the other peers, and rate 
8/9 from the source that it does not relay. 

The basic Mutualcast algorithm does not consider download constraints. The slight mod- 
ification of Mutualcast given below includes download bandwidth constraints Di in the sim- 
plest possible way. Note that if all peers are to receive at the same rate, that rate must be 
less than the smallest download constraint. This is reflected in line 1 of Algorithm [21 

Algorithm 2 The Mutualcast Algorithm with Download Bandwidth Constraints 

1: Given broadcast rate R < min ( Ug, " ' , minjg{i_...^Ar}(Dj) j . 

2: Given an order of peers. (Without loss of generality, assume the order is 1, ■ ■ ■ , A^.) 
3: for i = 1 to A^ do 

4: rs^i^mm{R,Di,Ui/{N -1)). 

5: Ti^j <- Ts^i for j ^ i. 

6: R ^ R — Tg^i. 

7: Us ^ Us - Ts^i. 

8: ^ Dj -Ts^ifoT J = !,■■■ ,N. 

9: end for 

10: Ts^i ^ Ts^i + R. 



As with the original Mutualcast, Algorithm [2] delivers the same rate to every peer. This 
alone is enough to prevent it from minimizes the sum download time in general when there are 
download constraints. However, it will turn out to be an important component of Extended 
Mutualcast, which is an algorithm that does minimize the sum download time under general 
download constraints. 

B. Extended Mutualcast 

Setting Wi = 1 for all i in (1271) produces the following lower bound on the sum download 
time when both upload and download constraints are considered: 
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where 



" B 



R, iiR< Di 

r: = { : ^ (34) 

D, if R> Di 

mm{R,Di,Us), (35) 



where R is chosen such that 

N N N 

J2 r* = mm{Us + t^., A). (36) 

i=l i=l i=l 

This lower bound can be achieved by a routing-based scheme that we call Extended Mutu- 
alcast. 

Consider a P2P network with constraints on peer uplink bandwidth and peer downlink 
bandwidth. Without loss of generality, assume that Di < • • • < Djy. Hence, -Di < ■ ■ ■ < 
Dn ^ Us and r]" < ■ ■ ■ < r^. The network resource allocation and the routing for Extended 
Mutualcast are provided in Algorithms [3] and H] respectively. 
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Algorithm 3 Network Resource Allocation for Extended Mutualcast 

1: Calculate R and r* {i = !,■■■ , N) from ^MB- 

2: Initialize network resource allocation rj^j ^ 0. 

3: if R< Di then 

4: r* = i?for alH = I,--- ,A. 

5: Apply Algorithm [2] with rate R to the network. 

6: else if Dj < R< Dj+i for j G {1, ■ ■ ■ , - 1} then 

7: r* = A for i G {1,- ■ ■ , j}- 
8: r* = for z G {j + 1, ■ ■ ■ , A}. 
9: for Step z = 1 to j do 

10: Successively apply Algorithm |2] with rate Di — Di^i [Dq = 0) to the network with 
the source node s and the ordered peers {«,■■■ , A}. Note that with each successive 
application of Algorithm [21 the values of rj_^j accumulate. 

11: end for 

12: Step j + 1: Apply Algorithm [2] one final time with rate R — Dj to the network with 
the source node s and the ordered peers {j + 1, ■ ■ ■ ,N}. Again, the values of ri^j 
accumulate. 

13: else if R> Dn then 

14: r* = A = A for z = 1, ■ ■ ■ , A. 

15: for Step i = 1 to A do 

16: Successively apply Algorithmic with supporting rate Di — -Dj_i to the network with 
the source node s and the ordered peers {«,■■■ , A}. Note that with each successive 
application of Algorithm [2l the values of Vi^j accumulate. 

17: end for 

18: end if 
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Algorithm 4 Routing Scheme for Extended Mutualcast 
1: Given R and r* (i = 1, • • • , A^) from ([HIM]). 

2: Given the network resource allocation rj^j = 1, ■ ■ ■ , A^) by Algorithm E] where 
Ti^i = Ts-^i > r-i^j. (This routing scheme is based on that network resource allocation.) 
3: Partition the whole file into many chunks. 
4: if i? < Di then 

5: Apply the routing scheme of Mutualcast [9J. That is, for each i = 1, ■ ■ ■ , N and some 
j ^ i, accumulatively route fraction of all chunks from the source node to peer i, 
and then copy and route them from peer i to other peers. Accumulatively route the 
rest of the chunks are from the source node to all peers directly. 

6: else if Dj < R < Dj+i for j = 1, ■ ■ ■ , - 1. then 

7: For k = 1, ■ ■ ■ ,j, broadcast fraction of all chunks to peers {fc, ■ ■ ■ , A} by 

Mutualcast. Broadcast the rest of the chunks to peers {j + 1, ■ ■ ■ , N} by Mutualcast. 
8: Until peers {j ' + 1, ■ ■ ■ , A} finish downloading. 
9: for Step i = j to 1 do 

10: In Step z, the interesting chunks are those peer i hasn't received. 

11: For k = I,-- - ,i, accumulatively broadcast fraction of the interesting 

chunks to peers k, - ■ ■ ,i hj Mutualcast. 
12: Until peer i finishes downloading. 

13: Note that peers i + 1, - ■ ■ , A finish downloading before Step i. 

14: Note that prior to Step i, none of the peers 1, ■ ■ ■ ,i contain the interesting chunks 

broadcast during Step i. 
15: end for 
16: else if R> Dn then 
17: for Step i = A to 1 do 

18: In Step the interesting chunks are those peer i hasn't received. 

19: For k = I,-- - ,i, accumulatively broadcast fraction of the interesting 

chunks to peers k, - ■ ■ ,i by Mutualcast. 
20: Until peer i finishes downloading. 
21: end for 
22: end if 
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The network resource allocation for Extended Mutualcast (Algorithm [3]) is obtained by 
successively applying Algorithm [2] to the P2P network or part of the P2P network. The 
network resource allocation by Algorithm [3] has r^^j > Vi^j for all The flow rate to 
peer i, ri, is then equal to its download rate J2f=i''^j~>i- The routing scheme for Extended 
Mutualcast (Algorithm H]) guarantees that the entire flow rate is useful. For the Extended 
Mutualcast rate allocation, = min(i?, Di) so that the lower bound (13311361) on sum download 
time is achieved. Theorem [2] formally states and proves this fact. 

Theorem 2: (Minimum Sum Download time) Consider multicasting a file with size B 
from a source node s to peers {1, ■ ■ ■ , A^} in a P2P network with constraints on peer uplink 
bandwidth and peer downlink bandwidth. The minimum sum download time for the static 
scenario is J2iLi where r*, the flow rate to peer i, follows from f l33ll36l) . 

i 

Proof: (Converse) From (I33tl36p . Ylf=i is a lower bound on the minimum sum 
download time. Hence, any sum download time less than YliLi ^ is not achievable. 
(Achievability) It is sufficient to show that (a) Extended Mutualcast is applicable to any 
P2P network, and (b) Extended Mutualcast provides a static scenario in which the flow rate 
from the source node to peer i is r* of f p3H36|) . 

(To Show (a)) It is sufficient to show that in Algorithm |3l the rate for each applied Algo- 
rithms [2] is attainable. In other words, each rate for the applied network is less than or equal 
to the minimum of the source node's uplink capacity and the total uplink resource over all 
of the peers. 



. UR< Di, then R < Us and R < ^- , Hence, the rate R is attainable for Algorithm 

[2] in Line 4, Algorithm |3l 

• If Dj < R < -Dj+i, consider the worst case of Di = R for i = j + 1, ■ ■ ■ ,N and Us = R. In 

27 



this case, we have 



r; = A, t = lr--,N; (37) 
U,<D, = D,, z = l,...,j; (38) 
Di<---<bj<R = Dj+i = ■■■ = = Us; (39) 

N N 

Y,D, = Us + Y,U^. (40) 

i=l i=l 

Denote Up'' as the total amount of the peers' uphnk resource used after Step z, and uf* as the 
total amount of the source node's uplink resource used after Step i. For Step 1, Di < Us and 
Di < — - = " j^^^ ' ■ Hence, Algorithm [2] in Step 1 is feasible. Suppose Algorithm [2] is 
feasible for Step 1 to Step n{l<n< j). Then t/F = A and f/^'^ = J^^^^ Dk + {N -i-l)Di. 
Hence, 

i i i 

k=l k=l k=l 

which indicates that Algorithm [2] for Step 1 to Step i fully deploys the uplink resources of 
peers 1, • • • , i. 

Now consider Algorithm |2] for Step n + 1, the supporting rate is Dn+i — Dn- The source 
node's uplink is i? — Dn. The total uplink resource is 

N 



f/s + 5^f/.-(f/i"^ + f/i"^) (42) 

i=l 

N n 

= Y,D^-{J2Dk + {N~ n)Dn) (43) 

1=1 k=l 
N 

= 5^ (A - A) (44) 

k=n+l 

>{N-n){Dn+i-Dn), (45) 
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where ( H3l) follows from ( HOj) . and ( H5|) follows from ( l39l) . Hence, the rate Z)„+i — Z)„ is less 
than or equal to the total available uplink resource ( H2l) divided by the number of peers, 
N — n. We also can see that -D^+i — Dn is less than or equal to the available source node's 
uplink bandwidth, R — Dn- Therefore, Algorithm |2] for Step n + 1 is also feasible. By 
induction. Algorithm [2] is feasible for every step. 
. If i? > Dn, then 

Di < ■ ■ ■ < Dn < R<Us] (46) 
r* = A = A, t = l,---,N; (47) 

N N N 

i=l i=l 1=1 

Consider the worst case of X^ili — X^ili + ^^s- For this worst case. Algorithm [2] in Line 
14 is feasible following an argument similar to that for the case of Dj < R < -Dj+i. 

Therefore, Extended Mutualcast in Algorithm |3] is applicable to any P2P network. 
(To Show (b)) From Algorithms [2] and |3l Extended Mutualcast constructs a static scenario 
with r^^i = Ti^i > ri_,j for i,j = l,---,N, and Y.f=i^j^i > min(i?, A) = r*. Hence, the 
maximum flow from the source node to peer i is larger than or equal to 

JV 

mm{rs^j,rj_,i) + rs-,i (49) 

N 

= 5Z '^j-^i + '^'s^i (50) 

j=ijyi 

>r*. (51) 

Therefore, Extended Mutualcast provides a static scenario in which the flow rate from the 
source node to peer i is r* of fl33tl36p . ■ 
Theorem [2] showed that Extended Mutualcast minimizes the sum download time for any 
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static P2P network. When the total uphnk bandwidth resource is sufficiently abundant, 
Extended Mutualcast also minimizes the weighted sum download time for any set of weights 
because all peers are downloading at their limit of Di. Corollary [T] formally states and proves 
this fact. 

Corollary 1: Consider multicasting a file with size B from a source node s to peers 
{1, ■ ■ ■ , A^} in a P2P network with constraints on peer uplink bandwidth and peer downlink 
bandwidth. If Us + Yl!i=i — J2iLi -^^ set of the flow rates ri = Di {i = 1, ■ ■ ■ , N) 
is attainable. Hence, the minimum weighted sum download time for the static scenario is 
J2iLi ^iTT given weights Wi. 

Proof: ( Achievability) Note that when f/, + ^Jl^ Ui > ^Jl^ Di, r* of (^M) is equal 
to Di. By Theorem [21 Extended Mutualcast can achieve the download rates r* = Di. 
(Converse) By Max-Flow Min-Cut Theorem, the maximum flow from source node to peer 
i is limited by Di = min{Di,Us). Hence, any weighted sum downloading time less than 
J2iLi ^iTT is '^'^^ achievable. ■ 

V. A Depth-2 Approach for the Minimizing Weighted Sum Download Time 

Section HVl pr ovided a complete solution (Extended Mutualcast) for achieving the minimum 
sum download time with constraints on both peer uplink bandwidth and peer downlink 
bandwidth. That section concluded by showing that if the total uplink resource is sufficiently 
abundant. Extended Mutualcast minimizes WSDT for any set of weights. This section 
attacks the minimization of WSDT more broadly. 

Mutualcast and Extended Mutualcast construct only two types of trees to distribute con- 
tent. The ffist type is a depth-1 tree as shown in Fig. [6]^a). The source node s broadcasts 
content to all peers directly with rate r^^^, i = 1, ■ ■ ■ , A^. The second type is a depth-2 tree 
as shown in Fig. [6]^b). The source node distributes content to peer i with rate r^^^, and then 
peer i relays this content to all other peers. 

In Mutualcast, the rates rf\^- are constrained to be equal for all i. Also, for a fixed z, 
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f^s-^i = for all i G {I,-- - ,iV}; i 7^ i- These constraints on the network resource 
allocation simplify the mechanism design and allow a simple routing-based scheme. These 
two constraints together ensure that each peer downloads content at the same rate. However, 
to optimize WSDT peers surely need to download content and different rates. 

In Section HVl we saw that peers needed to download content at different rates to minimize 
the sum download time with peer downlink bandwidth constraints. The Extended Mutu- 
alcast algorithm provided a way to serve the peers at different rates corresponding to their 
download bandwidth constraints so as to minimize the sum download time. However, Ex- 
tended Mutualcast required successive applications of Mutualcast which led to a complicated 
routing protocol. 

In order to serve peers at different rates to minimize WSDT and still maintain a simple 
mechanism design, we apply the technique of rateless coding at the source node. A rateless 
code is an erasure correcting code. It is rateless in the sense that the number of encoded 
packets that can be generated from the source message is potentially limitless [30]. Suppose 
the original file size is B packets, once the receiver has received any B' packets, where B' is 
just slightly greater than B, the whole file can be recovered. 

Fountain codes [30], LT codes [31], and raptor codes [32] are rateless erasure codes. LT 
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codes have linear encoding complexity and sub-linear decoding complexity. Raptor codes 
have linear encoding and decoding complexities. The percentage of the overhead packets 
goes to zero as B goes to infinity. In practice, the overhead is about 5% for LT codes with 
file size B ~ 10000 [HD]- This sub-section focuses on applying rateless erasure codes for P2P 
file transfer instead of designing rateless erasure codes. Hence, we assume the overhead of 
the applied rateless erasure code is zero for simplicity. We note that if redundancy does not 
need to be limitless, there are solutions that provide zero overhead [33] . 

A. The Rateless-Coding-Based Scheme 

We propose a rateless-coding-based scheme that constructs the two types of trees in Fig. [H] 
to distribute the content as did Mutualcast and Extended Mutualcast. The source node first 
partitions the whole file into B chunks and applies a rateless erasure code to these B chunks 
producing a potentially limitless number of chunks. 

For the depth-1 tree, the source node broadcasts different rateless-coded chunks directly 
to each peer. For the depth-2 trees. The source node sends different rateless-coded chunks to 
each peer, and then that peer relays some or all of those chunks to other some or all of the 
other peers. A key point is that every chunk transmitted by the source is different from every 
other chunk transmitted by the source. This condition guarantees that all chunks received by 
a peer are useful (because they are not a repetition of a previously received chunk). Hence, 
a peer can decode the whole file as long as it receives B coded chunks. 

The rateless-coding-based scheme allows peers to download content at different rates with 
a simpler mechanism than the routing-based approach of Extended Mutualcast. Peers don't 
have to receive exactly the same chunks to decode the whole file. Hence, the two types of tree 
structures can be combined as one tree structure with depth 2, but without the constraint 
that the rate from the peer to its neighbors has to equal the rate from the source to the peer. 

The source node sends coded chunks to peer i with rate Ts^i = r^^^ + r^^^, and peer i 
relays some of them to peer j {j ^ i) with rate r^^j < Ts-^i- Note that the values of ri^j do 
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not even need to be the same for a fixed value of i and different values of j. 

Another benefit of applying a rateless coding approach is that it is robust to packet loss 
in the Internet if we allow some extra rate for each user. 

Assuming rateless coding at the source node and constraining the P2P network to include 
only depth-2 trees as discussed above, the network resource allocation that minimizes WSDT 
can be obtained by solving the following convex optimization problem. 

subject to < Ti^j < ri^i,\/i,j = 1, ■ ■ ■ , A^, 

Elin^^<Us, (52) 

Ej^ij^i ri^j < t/i, Vi = 1, ■ ■ ■ , AT, 
r- = r < D- \fi = 1 ■ ■ ■ N 

where rj_^j = Vg^i. The complexity for the interior point method to solve this convex 
optimization is 0{{N'^f-^) |29] . 

For the case of Wi = l,Di = oo, the optimal resource allocation is, of course, the same as 
that of Mutualcast. 

For the case of Wi = 1 and finite values of Di, Algorithm |3] provides an optimal network 
resource allocation that certainly also solves (132]) . A key point is that the routing of Algo- 
rithm H] becomes unnecessary if the source employs rateless coding. Peers need only relay the 
appropriate number of chunks to the appropriate neighbors without worrying about which 
chunks are relayed. 

For other cases, we provide a network resource allocation that we have not proven to be 
optimal. We will see in Section IV-EI that its performance achieves the lower bound (l29l) 
across a wide range of parameterizations. 
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B. Resource Allocation for Networks with Di = oo 



Consider a P2P network in which peer uphnk bandwidth is constrained but Di = oo for 
i = l,--- ,N. IiJ2tiUi> {N-l)Us, then the resource allocation of ri^j = J^n^Kj achieves 
the minimum WSDT with flow rates rj = Ug for alH = 1, ■ ■ ■ , A^. (This is the case discussed 
at the end of Section HVl) Otherwise, consider the following water- filling-type solution: 



Wi-R, ii^,-R<Us, , ^ 

(53) 



where R is chosen such that 



N" N 

^fi = V,^^U,~ max(ffc) . (54) 

1=1 i=l 



The potential suboptimality of this approach comes from the subtraction of maxfc(ffc) on 
the right side of (15^ which does not appear in ( I28ll . Note that when max(ffc) <^ ^s + Xlili ^« 
(this is true for large iV), fj is close to r* corresponding to the lower bound ( 1271) . 

We now show that the proposed suboptimal network resource allocation ensures that the 
flow rate to peer i is larger than or equal to fj of ( l53l) . Hence, the WSDT for the proposed 
suboptimal resource allocation is very close to the lower bound to the minimum WSDT for 
large networks. 

First assign the rates for the depth-2 trees with 



^(2) _ ^ [/jmax(rfc) 

Z^fc=l "''k fi 



2) --^jiiiaAV'fc; 



and 



Z^fc=i ^fc ~ 
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where c is chosen to be the largest possible value satisfying 



TV 

J2''^X<Us (57) 

i=l 
N 

J2 ^^^i<U,. (58) 



Plugging ([55]) ([56D into ([57]) ([58]), and obtain 



c = min(l, ^1 , ), (59) 

max(rjfcjQ; 



where a = Ylf-i — ^• 



If c = — then the depth-2 trees have already fully deployed the source node's 
uplink. The rate assignment for depth-2 trees is the network resource allocation for the 
rateless-coding-based scheme. 



If c = 1, then the depth-2 trees have fully deployed all peers' uplinks, but not the source 



node's uplink. Hence, we can further deploy the rest of the source node's uplink to construct 



the depth-1 tree. After constructing the depth-2 trees, the flow rate to peer i is 



A (2) , ~ , (maxfc(rfc) - ri)Ui 



The rest of the source node's uplink is 



N 

Us-^ rfXi = Us-a max(ffc). 

i=l 
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The optimal depth-1 tree can be obtained by the convex optimization 



mm 
subject to 



ri = + r^^i, 



> 0, Vi = 1 ■ ■ ■ , AT, 



Ell '"Hi <Us- amax(ffc). 



(60) 



The optimal solution to the problem (1^ is 



Pi 



if y/Wl ■R<(3i 



(61) 



and 



,(1) 



ii^/Wi-R<Pi, 

^Af (1) TT /- \ v^A^ TT , v^Af 



(62) 



where R is chosen such that X]i=i "^Wi = Us — amax(rfc) (also Ej=i fi = Us + X]i=i Ui)- 



The complexity of calculating this suboptimal network resource allocation is 0{N'^). Note 
that when Wi = 1 for all i = 1, ■ ■ ■ , A^, this suboptimal network resource allocation is the 
same as that of Mutualcast, and hence, this network resource allocation is optimal for this 
case. For general weight settings, this network resource allocation guarantees that the flow 
rate to peer i is larger than or equal to fj, which is stated in the following theorem. 



Theorem 3: For P2P networks with peer uplink constraints but no peer downlink con- 
straints (i.e. Di = oo), the network resource allocation determined by ( l55l) ( l56l) ( l59l) ( 1611) 
and (l62l) ensures that the WSDT Eili ^iB/ri is less than or equal to the WSDT associated 
to (PD, i.e., Etl^^B/ri. 
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Proof: If c = — X , the flow rate to peer i is 



N 

(2) 



maxfc(ffc) - ri) Ui 
~ _ ~ 



can + c — — : : — (64) 



> car, (65) 
~ (66) 



max(rfc) 

> r„ (67) 
where I E7}i follows from < -D/t < t/^- If c = 1, a feasible solution to problem (p7]) is 

rl^i = {Us - amax(ffe)) 



For this feasible solution, the total flow rate to peer i with the depth- 1 tree and the depth-2 
trees is 



ri = Pi + [Us - a max(ffc)) (68) 



(max(rfc) - ri)? /^ 

fc=l '"'s 2^k=l ~ 



(a + ([/^ - a max(rfc)) ^ )rj H : — • (69) 
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Denote 7 = a + {Ug — a max(rfc)) ^y-^ . . We have 



N N 



Us + J2u, = J2r, (70) 

i=l i=l 

i=l i=l l^k=l 

N N ,^ . 

+ E - (72) 

N 

< 7 X] + max(ffc) (73) 

i=l 

N 

= '^{Us + y^Ui- max(ffc)) + max(ffc). (74) 
1=1 

Some of these steps are justified as follows: 

rO|l follows from the fact that all uplink resource is deployed; 
2]) follows from the inequality < ^ when a > 6 > > 0; 
Wj follows from Y1=i rk = Us - maxfj. + Xl^i f^i > Zlili Ui > Ui. 
Therefore, 7 > 1. Hence, 

r.=,n+ ^''l-''''!"' >f.. (75) 



which indicates that this feasible solution to the problem ( ]60l) provides a WSDT less than 
or equal to J2iLi^iB/ri. Hence, the network resource allocation determined by ( l55i) ( l56l) 
dSni) (EU) and (El also provides a WSDT less than or equal to J^^Li WiB/vi. ■ 



C. Resource Allocation with Peer Downlink Constraints 

Now we consider P2P networks with both peer uplink bandwidth constraints and peer 
downlink bandwidth constraints. The idea of the resource allocation for these P2P networks 
is the same as that for P2P networks without downlink constraints. The details are provided 

38 



as follows: 



If Us + Ui > J2i=i from Corollary [H Algorithm |3] provides the optimal network 
resource allocation. 



If Us + J2iLi < J2iLi -Dj, consider a water- filling- type solution 



(76) 



where R is chosen such that Vi = Us + J^iLi Ui — max(rfc 



First construct the depth-2 trees with rates in (155|) and ( 156|) . where c is still chosen to be 
the largest possible value. However, for general P2P networks, the constraints on c are not 
only (I57D ([SH]), but also 

TV 

A = rS,+ ^.-^<A. (77) 

After constructing the depth-2 trees, the flow rate to peer i is The used source node's 
uplink is camax(ffc). If camax(ffe) < Us, we can further use the rest of the source node's 
uplink to distribute content through the depth-1 tree. The optimal resource allocation for 
the depth-1 tree can be obtained by the convex optimization 

niin Ef=iW^4 
subject to Tj = /3j + r^^^, 

r;^,>0,V^ = l---,Ar, (78) 
ri< A,V2 = 1-- - ,iV, 
Eii <Us- camax(ffc). 
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The optimal solution to the problem ( ITHj) is 



r, = < (3i if 



(79) 



if y/m ■R> D, 



and 



r-i - A 



(80) 



where R is chosen such that 



AT 



N 



i=l 



i=l 



The complexity of calculating this resource allocation is 0{N'^ 



D. Routing-Based Depth-2 Scheme 

So far, this section has provided a family of rateless-coding-based schemes for P2P file- 
transfer applications. In this subsection, we introduce a routing-based scheme. This routing- 
based scheme is a further extension to Extended Mutualcast. This scheme also applies the 
tree structures in Fig. |6] to distribute content. The constraints on the network resource 
allocation for this scheme are 

r^X >■■■> ^^Ik^ > 0' (81) 

and 

»,(2) ^ ^ ^ ^ ^ 

> rk,^k,+^ >■■■> rk,^kN,yi = 1, ■ ■ ■ ,N, (82) 
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where (fci, ■ ■ ■ , k^) is the order in which the peers finish downloading. In the rest of this sub- 
section, we assume the order is (1, ■ ■ ■ , A^) for simphcity. These constraints are stricter than 
those of the rateless-coding-based scheme, and they are introduced to simphfy the routing 
scheme. In particular, given the order of (1, ■ ■ ■ , A^) in which peers finish downloading, the 
proposed routing-based scheme ensures that at any time in the scheme, peer i has all packets 
received by peers z + 1, ■ ■ ■ ,N for all z = 1, ■ ■ • , — 1. This condition can be achievable if 
the network resource allocation satisfies ( IHTj) and (l82i) . For the routing-based scheme, when 
peer i finishes downloading, the scheme starts to only broadcast the chunks which peer i + 1 
hasn't received, called interesting chunks. With this condition, the interesting chunks are 
also new to peers i + 2, ■ ■ ■ ,N. The details of the routing-based scheme is given in Algorithm 

El 

Algorithm 5 Routing-Based Scheme 

1: Given the order in which the peers finish downloading. Assume the order is (1, ■ • ■ , A^) 
for simplicity. 

2: Given the network resource allocation {vi-^j, r^^^, r^^-} for i,j = l,---,N, which satis- 
fies the constraints (IHTj) and (l82l) . 
3: Partition the whole file into many chunks. 
4: for Step z = 1 to A^ do 

5: At the beginning of Step i, peer 1, ■ ■ ■ ,i finish downloading. 

6: In Step i, only broadcast the chunks which peer i doesn't have, called interesting 
chunks. Note that all peers i, - ■ ■ ,N don't contain the interesting chunks. 

7: Distribute interesting chunks along the depth- 1 tree and the depth-2 trees according 
to the network resource allocation. 

8: For the depth- 1 tree , the set of chunks sent to peer i contains the set of chunks sent 
to peer j for i < j. 

9: For the depth-2 tree in Fig. [6](b), the set of chunks from peer i to peer k contains the 
set of chunks from peer i to peer j for k > j. Peer i only keeps the set of chunks sent 
to peer i — 1 for z = 2, ■ ■ ■ , A^. 

10: The above two chunk selection constraints guarantee that peers i, - ■ ■ ,N don't contain 
the interesting chunks in Step i for z = 1, ■ ■ ■ , A^. 

11: Until peer i receives all interesting chunks and finishes downloading. 

12: end for 



The optimal network resource allocation for this routing-based scheme can be obtained 
by the convex optimization of minimizing Xli^i "^iB/ri subject to the constraints (1811) ( 182|) . 
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nodes' uplink and downlink constraints, and the flow rate expression 

N 

(2) 

where ri^o = '^^^^ complexity for the interior point method to solve the problem is 

0((A^^)'^'^). For the case of PFj = 1 and Di = oo, the optimal network resource allocation 
is the same as that of Mutualcast. For the case of VFj = 1 or Us + J2iLi Ui > Xlili by 
Theorem |2] and Corollary [H Algorithm [3] provides the optimal network resource allocation. 

For general cases with Us + J2iLi Ui < Yl!i=i provide a suboptimal network resource 

allocation for this routing-based scheme. Consider the water-filling- type solution in fl76|) . 
Without loss of generality, assume that ri > ■ ■ ■ > tn, and give the ordering (1, ■ ■ ■ , A^) in 
which the peers finish downloading. First construct the depth-2 trees with rates in (155|1 and 
(|56|) . where c is still chosen to be the largest possible value satisfying f l57|l fl58|) and fl77|) . 
After constructing the depth-2 trees, the effective flow rate to peer i is 

N 

A = rj^i + ri^i-i (83) 

= c{afi + -^f^^^U), (84) 

where fo = fi- The download rate (used downlink) for peer i is c{afi + ^n^'P _ Ui). Note 
that the effective flow rate is smaller than the download rate for peer i. This is because 
peer i only keeps a subset of chunks received from the source node. For this reason, parts of 
peer i's downlink and the source node's uplink are wasted. The total amount of the wasted 
uplink is 

TV _ _ 
i=l l^k=l "^k 

The used source node's uplink is cafi. If cafi < Ug, we can further use the rest of the 
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source node's uplink to distribute content through the depth-1 tree. The constraints on the 
resource allocation for the depth-1 tree are (ISTll . 



rE,< A:-A,Vz = l,---,iV, (86) 



and 

N 



Y,ril,<U,-car,. (87) 

i=l 

Let Wi = m.mk<i(Wk) . Let Di = mmk<i{Dk — (3k)- A sub-optimal network resource allocation 
for the depth-1 tree is 

Vw^ -R-Pi, if A < Vw^ ■R<Di, 

(1) 



r 



if VWi ■R<f3„ 

bi-p,, ifv^-i?>A, 



and Ti = r^^j -|- /5j, where R is chosen such that 



AT 



r^^li = Us- can 



1=1 



and also 

N N 



J2ri = Us + cJ2u^-U^ 



i=l i=l 

The complexity of calculating the suboptimal resource allocation for the routing-based 
scheme is Oi^N"^). 

E. Simulations for the Static Scenario 

This subsection provides the empirical WSDT performances of the rateless-coding-based 
scheme, the routing-based scheme, and compares them with the lower bound to the WSDT. 
In all simulations, the file size B is normalized to be 1. This subsection shows simulations 
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for 6 cases of network settings as follows: 

• Case I: Ui = 1, Di = CO for i = 1, N; 

. Case II: f/i = 1, A = 8 for z = 1, ■ ■ ■ , A^; 

. Case III: Ui = i/N, Di = oo ioi i = 1, ■ ■ ■ , N; 

. Case IV: U^ = i/N, Di = 8i/N hi i = 1, ■ ■ ■ , N; 

. Case V:Ui = l + 9S{i > N/2), D^ = oo ior i = I, ■ ■ ■ , N; 

. Case VI: = 1 + 96{i > N/2), A = 8i/N, i = l,--- ,N; 

where 6{-) is the indicate function. 

Consider small P2P networks with = 10 peers. The performances of sum download 
time versus Us for these 6 cases are shown in Fig. [71 The performances of WSDT versus 
Us with weight Wi = i/N {i = 1, ■ ■ ■ , N) are shown in Fig. |H1 The performances of WSDT 
versus Us with weight Wi = 1 + 6{i > N/2) {i = 1, ■ ■ ■ , N) are shown in Fig. |9l In all these 
simulations, the weighted sum download times of the rateless-coding-based scheme and the 
routing-based scheme achieve or almost achieve the lower bound. 

Consider large P2P networks with = 1000 peers. The performances of sum download 
time versus Us for these 6 cases are shown in Fig. [101 The performances of WSDT versus 
Us with weight Wi = i/N {i = 1, - ■ ■ , N) are shown in Fig. [Til The performances of WSDT 
versus Us with weight Wi = 1 + 6{i > N/2) (i = 1, ■ ■ ■ , A^) are shown in Fig. [121 In all these 
simulations, the weighted sum download times of the rateless-coding-based scheme and the 
routing-based scheme also achieve or almost achieve the lower bound. 

We also simulated for many other network settings and weight settings. In all these simu- 
lations, the rateless-coding-based scheme achieves or almost achieves the lower bound to the 
WSDT. Hence, the lower bound to the WSDT is empirically tight, and the rateless-coding- 
based scheme has almost-optimal empirical performance. The routing-based scheme also 
has near-optimal empirical performance. However, for few cases there are clear differences 
between the performance of the routing-based scheme and the lower bound. 
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Fig. 7 

Sum download time versus Us for small P2P networks with = 10 peers. 



VI. The Dynamic Scenario 

The dynamic scenario is allowed to re-allocate the network resource during the file transfer, 
in particular, whenever a peer finishes downloading, joins into the network, or leaves from 
the network. 

A. A Piece-wise Static Approach to the General Dynamic Case 

Wu et al. [27] show that to optimize WSDT the network resource allocation should be 
dynamic, but may remain constant during any "epoch" , a period of time between when one 
receiver finishes downloading and another finishes downloading. Thus, one optimal solution 
for the dynamic scenario is "piecewise static" . 

As an example of how a "piecewise static" dynamic allocation can reduce the WSDT, 
consider the example for which we studied static allocations in Section IIII-AI Recall that 
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Fig. 8 

Weighted sum downloading time versus Us for small P2P networks with = 10 peers and 

WEIGHT Wi — i/N . 



the example was for a P2P network with Us = 2, B = 1 and three peers {1,2,3} with 
Ui = U2 = Us = I and Di = D2 = = 00. Fig. [13] shows the time-expanded graph 
corresponding to the optimal dynamic rate allocation for this example. Because there are 
three peers, this time-expanded graph describes a file transfer scenario with 3 epochs. The 
first epoch lasts 0.5 unit time. In the first epoch, the source node sends half of the file to 
peer 1 and the other half to peer 2. Peer 1 and peer 2 exchange their received content, and 
hence, both peer 1 and peer 2 finish downloading at the same time. Hence, the second epoch 
lasts time units (since ^2 — ^ — 1 = 0). The third epoch lasts 0.25 unit time, in which the 
source node, peer 1 and peer 2 transmits to peer 3 simultaneously. Peer 1 sends a quarter 
of the file. Peer 2 sends another quarter. The source node sends the other two quarters. 

This dynamic solution turns out to achieve the minimum possible sum download time for 
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Weighted sum downloading time versus Us for small P2P networks with A'' 

WEIGHT Wi^l + 99S{i > N/2). 



10 PEERS AND 



this example which is 1.75. For comparison, the optimal static solution, which we saw in 
Section FlII-AI had an only slightly larger sum download time of 1.8. This simple example 
shows that a dynamic rate allocation can reduce WSDT. In certain cases the benefit can be 
significant. Dynamic schemes can reduce the minimum sum download time to approximately 
half that of the static case, at least when downlink capacities are considered to be infinite 

m- 

B. A Rateless- coding Approach to Dynamic Allocation 

Wu et al. [27] propose a dynamic routing-based scheme. This scheme first deploys all 
uplink resource to fully support the first K peers until they finish downloading, where K is 
appropriately chosen. After that, the scheme deploys all uplink resource to fully support the 
next peer until it finishes downloading, an so forth. Inspired by the work ^27j, we propose a 

47 



800 



Case I 



Q 
E 

Z] 



600 



400 



-e — Lower Bound 
-B — Rateless Code 
Routing 



200 400 600 800 
Source Node Uplink Us 
(a) 
Case III 



1000 



2000g<^ 



1500 



-e — Lower Bound 
-B — Rateless Code 
Routing 



1000 



500 



1000© 



200 400 600 800 1000 
Source Node Uplinl^ Us 
(c) 
Case V 



Q 

E 



500 



-h3 — Lower Bound 
-B — Rateless Code 
Routing 



200 400 600 800 
Source Node Uplink Us 
(e) 



1000 



Case II 



100CX&.-t — 



800 



600 



400 



-e — Lower Bound 
-B — Rateless Code 
Routing 



200 400 600 800 1000 
Source Node Uplink Us 
(b) 
Case IV 



2500, 



2000 



1500 



-e — Lower Bound 
-B — Rateless Code 
Routing 



1000 



1000© 



200 400 600 800 1000 
Source Node Uplink Us 
(d) 
Case VI 



Q 

E 
u 
U3 



500 



-e — Lower Bound 
-B — Rateless Code 
Routing 



200 400 600 800 1000 
Source Node Uplink Us 
(f) 



Fig. 10 

Sum download time versus Us for large P2P networks with N = 1000 peers. 



dynamic rateless-coding-based scheme for P2P networks with both peer uphnk bandwidth 
constraints and peer downhnk bandwidth constraints. This scheme is apphcable for dynamic 
P2P networks in which peers may even join or leave the network. 

The key idea of this dynamic rateless-coding-based scheme is similar to that of the dynamic 
routing-based scheme in [27]. In particular, in each epoch, the scheme deploys all uplink 
resource to fully support several chosen peers. The details of the dynamic rateless-coding- 
based scheme are provided in Algorithm |6l 

Algorithm [6] provides the structure of the dynamic rateless-coding-based scheme. Because 
the peers always receive independently generated rateless coded chunks in the static rateless- 
code scheme, the dynamic rateless-coding-based scheme is also applicable for dynamic P2P 
network. As long as a peer receives enough rateless coded chunks ^, it can decode the whole 

^The number of coded chunks needed to decode the whole file is only slightly larger than the total number of the 
original chunks. 
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Weighted sum downloading time versus Us for large P2P networks with N 
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1000 PEERS 



Algorithm 6 Dynamic Rateless-Coding-Based Sclieme 
1: Initiate the P2P network. Peers join into the network. 

2: while A peer finishes downloading, joins into the network or leaves from the network 
do 

3: Select a set of peers and reset peers' weights. (The peer selection algorithm and the 

weight setting are addressed in Section IVI-Cp 
4: Apply the static rateless-coding-based scheme based on the new weights until a peer 

finishes downloading, joins into the network or leaves from the network. 
5: end while 



file. The key issue is how to set the peers' weights in each epoch. Since the weight setting 
and the static rateless-coding-based scheme in the current epoch will infiuence the dynamic 
scheme in the following epoches, the problem of setting weights is very complicated. We will 
address this problem in Section IVl-Cl and show that this problem is approximately equivalent 
to selecting a set of peers to fully support. 
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Weighted sum downloading time versus Us for large P2P networks with N — 1000 peers 

AND WEIGHT Wi ^ 1 + 99S{i > N/2). 



C. A Solution to the Ordering Problem 

Wu et al. [27] demonstrate that given an order in which the receivers finish downloading, 
the dynamic allocation (neglecting downlink bandwidth constraints) that minimizes WSDT 
can be obtained in polynomial time by convex optimization and can be achieved through 
linear network coding. However, [27] leaves the proper selection of the ordering as an open 
problem and does not address the finite downlink capacities Di < oo or the general case of 
weighted sum download time which allows any values of the weights Wi. 

The simulations for the static scenario in Section IV-EI show that the WSDT of static 
rateless-coding-based schemes are very close to that of the lower bound ( 12711291) . Hence, the 
fiow rates in f l271) are achievable or almost achievable by the static rateless-coding-based 
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A TIME-EXPANDED GRAPH FOR A P2P NETWORK WITH THREE PEERS WHERE C/5 = 2, -8 = 1, 

Ui = U2 = U'i = 1 AND Di = D2 = = 00. Edges are labeled with the total information flow 

ALONG THE EDGE DURING THE EPOCH. THIS IS THE PRODUCT OF THE RATE ALLOCATION ALONG THE 
EDGE DURING THE EPOCH AND THE DURATION OF THE EPOCH. 



scheme. Recall that the constraints on the rate rj in (|27|) are 

0<ri<A, Vz = l,---,Ar, 

and 

N N 
i=l i=l 

In the following discussion, we assume that any set of flow rates rj (i = 1, ■ ■ ■ , A^) satisfying 
the above constraints is achievable by the static rateless-coding-based scheme. 

Consider one epoch of the dynamic rateless-coding-based scheme. Suppose there are N 
peers in the network in the current epoch. Peer i {i = 1, ■ ■ ■ , N) has uplink capacity 
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Ui, downlink capacity and B — q^B received rateless-coded chunks. Suppose the static 
rateless-coding-based scheme supports peer i with flow rate ri {i = 1, - ■ ■ ,N) based on a 
weight setting. In order to find the optimal weight setting for the current epoch, we study 
the necessary conditions for the flow rates {i — 1, N) to he optimal. 

Let us first focus on two peers in the network, say peer 1 and peer 2. The total amount 
of the uplink resource supporting peer 1 and peer 2 is s = ri + r2. If the flow rates for 
i — 1, - ■ • ,N is optimal, then the flow rates ri and r2 are also the optimal resource allocation 
for peers 1 and 2 given that the flow rates for i = 3, ■ ■ ■ , N are fixed. Now consider a 
suboptimal scenario in which the uplink resource with the amount of s serves peers 1 and 2, 
and the rest of the uphnk serves other peers in all of the following epoches. This suboptimal 
scenario provides a WSDT close to the minimum WSDT if S'^Us + Eti Ui (this is true for 
large A^). Hence, we consider this suboptimal scenario and address the necessary conditions 
for ri and r2 to be the optimal resource allocation for peers 1 and 2. 

If ^ < then peer 1 finishes downloading before peer 2 does. After peer 1 finishes 
downloading, peer 1 acts as a source node and hence the total amount of the source nodes' 
uplink is Ug + Ui, and peer 2 is supported by the uplink resource with the amount of s. 
Hence, the WSDT for peers 1 and 2 is 

A. = H..«ig + ^.(M+ .f;f:„J , (89) 

ri ri mm{s,D2,Us + Ui) 

and 

^ = -w, + -^^^V^)- (90) 

ri rj mm(s,L>2,C/s + C/i) 

Note that the sign of ^ does not depend on ri. Hence, the optimal solution to ri is either 

ri — r2qi/q2 (peer 1 and peer 2 finish at the same time) if —Wi — W2 + niin(s ihu +Ui) — 0) 

or ri = min(s, ^1) (peer 1 is fully supported) if -Wi -W2+ ,^in{s,D^^Us+Ui) 0. Simflarly, 

if ^ then peer 2 finishes downloading before peer 1 does. The WSDT for peers 1 
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and 2 is 

q2B 



and 



^ = + — -). (92) 

r2 r| min(s,L)i,C/s + L'2) 

Note that the sign of ^ does not depend on r2 eithter. Hence, the optimal solution to r2 is 
either r2 = Tiq2/q\ (peer 1 and peer 2 finish at the same time) if -W2 -Wi + > 
0, or r2 = min(s, D2) (peer 2 is fully supported) if —W2 — Wi-\- ^^^^.^ ^jj^-^ < 0. Therefore, 
the optimal resource allocation for peer 1 and peer 2 is achieved when one of the peers is 
fully supported, or they finish at the same time. 

Lemma 4-' Given that the flow rates to peer i for i = 3, - ■ ■ ,N are flxed, and the amount 
of uplink resource supporting peer 1 and peer 2 is s. If the optimal resource allocation for 
peer 1 and peer 2 is achieved when they finish at the same time, then both peer 1 and peer 
2 are fully supported. 

Proof: Let — min(s, Di, Ug + U2) and D'^ — min(s, D2, Us + Ui). According to the 
above discussion, the optimal resource allocation for peer 1 and peer 2 is achieved when they 
finish at the same time if and only if s > + D2, or > and ^ > 0. 
li s > Di -\- D2-I then ri — Di, r2 — D2, and hence, peers 1 and 2 are fully supported. 
If ^ > and ^ > 0, then 

-Wi - M/2 + --^ > 0, 

and 

-W2 - W^i + > 

. Hence, < l^i < ^7s+^ W2 and < W2 < ^~£l^ Vt^i- Multiply the above two inequalities 
and obtain 

s>bt + bt>bi + b2. 
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Therefor, peer 1 and peer 2 are also fully supported. 



Corollary 2: Given that the flow rates to peer i for i = 3, ■ ■ ■ ,N are fixed, and the amount 
of uplink resource supporting peer 1 and peer 2 is s. The optimal resource allocation for 
peer 1 and peer 2 is achieved when one of them is fully supported or both of them are fully 
supported. 

Corollary 3: The optimal network resource allocation in each epoch of a dynamic scenario 
is only obtained when some peers are fully supported, at most one peer is partially supported, 
and the other peers are not supported. 

Proof: (proof by contradiction) If two peers are partially supported, say peer 1 
and peer 2 are partially supported, then the resource allocation for peer 1 and peer 2 is not 
optimal by Corollary [2l ■ 

By Corollary |3l the optimal weight setting in each epoch is W = 1 for the fully supported 
peers, < < 1 for the partially supported peer, and W = for other peers. Hence, the 
problem of optimizing the weight setting is approximately equivalent to selecting a set of 
peers to fully support. 

Now study the necessary conditions for a peer selection to be optimal in a similar way. 
Suppose that the amount of uplink resource supporting peer 1 and peer 2 is s, and the flow 
rates to peer i for i = 3, ■ ■ ■ ,N are fixed. 

If s < 2i±22£)-^ and s < ^^^^D2, then peer 1 finishes downloading if peer 1 is fully supported, 
or peer 2 finishes downloading if peer 2 is fully supported. When peer 1 is fully supported, 
the WSDT for peer 1 and peer 2 is Ai in ( l89l) with ri = Di. When peer 2 is fully supported, 
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the WSDT for these two peers is A2 in ( l9T|) with r2 = D2- Hence, we have 

qiB ,qiB - 



Di ' Di mm{s,D2,U, + Ui)' 

D2 D2 mm{s,Di,Us + U2) 

_,,w.(J,-i-)(i-4-, 

+ (q,BW^ - q^BWr)(l- + 4- - -=i-) (94) 

L>i L>2 ^^1^^2 



.W2 Wi.qiq2B{Di + D2-s) 



q2 Qi D1D2 



(95) 



Therefore, it is better to first fully support peer 1 if ^ < ^ when s < ^^^-^Di and 

92 ^ 

If > and ^1+*?^ < s < ^^^^D2^ then peer 2 always finishes downloading before peer 
1 does. In this case, it is better to first fully support peer 1 if ^ > 0, i.e.. 



W2 s- Di 
or approximately 



W2 s-Di 



If < and '^^^^^ D2 < s < ^^^^Di, then peer 1 always finishes downloading before peer 
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2 does. In this case, it is better to first fully support peer 1 if — < 0, i.e., 



or approximately 



W2 ^ s-Dt' 



Wi ^ D2 



W2 s-Do 



These discussions are concluded in the following theorem. 

Theorem 4: Given that the amount of uplink resource supporting peer i and peer j is s, 
and the fiow rates to peer k for k ^ i,j are fixed. The optimal resource allocation for peer i 
and peer j is to fully support peer i (i.e., = Di) if 

^ when^>^. ^ ' 

Corollary 4: Consider a peer selection for a dynamic scenario which selects peer i to fully 
support and peer j to not support. This peer selection is optimal only if 

r max(^, when -2^ < 

111 ^ J Wj' D, ' A -Di' ^97) 

^ when |^ > 1^. 

Proof: When peer i is fully supported and peer j is not supported, s = ri + rj = D^. 
Plugging s = Di into ( 196|) and obtain ( !97|l . ■ 
Define the binary relation >- on {1, ■ ■ ■ , A^} as z >- j if ( |97jl is satisfied. Denote a peer 
selection as (/, J) where J is the set of fully supported peers and J is the set of unsupported 
peers. (J, J) is optimal only if i y j for any i G / and j G J. For general P2P networks, 
finding the optimal (/, J) is computational impossible because the binary relation >- is not 
transitive, which means 

i y j;j y k =^ i y k. 
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an approximation to the binary relation z j is equivalent to i y j when 



Define the binary relation on {1, ■ ■ ■ , A^} as i j if ^ > The binary relation ^ is 



^ > (98) 

It can be seen by plugging (!98|) into (!97|) . The approximated binary relation has the 
transitive property, and hence, the peers can be ordered with respect to fc. Based on this 
ordering, a suboptimal peer selection algorithm and the corresponding weight setting is 
constructed as shown in Algorithm [71 

Algorithm 7 Peer Selection and Weight Setting 



1: Suppose peers are downloading in the current epoch. 

2 

3 



Let B—QiB {0 < Qi < l)be the number of chunks that peer i has received for i = 1, ■ ■ ■ , N . 
ies 

t ^ 

Select peers {ki}fi-^ to fully support. 
Set Wj = 1 if j e {ki}fi^, or Wj = otherwise. 



4: Find the smallest M such that X^^i-Ofc^ > Ug+Y^^^Pi 
5 

6 



VII. Simulations of the Dynamic Scenario 

The dynamic rateless-coding-based scheme is feasible to both static P2P networks and 
dynamic P2P networks. Consider a type of dynamic P2P networks which any peer leaves 
from as it finishes downloading, and no peer joins into. This section provides the empirical 
WSDT performances of the dynamic rateless-coding-based scheme for static P2P networks 
and dynamic P2P networks with peer leaving, and compares them with those of the the 
static scenario for static P2P networks. In all simulations, the file size B is normalized to 
be 1. This section shows simulations for Cases I, II, IV, and VI investigated in IV-EI 

Consider median-size P2P networks with A^ = 100 peers. The performances of sum down- 
load time versus Us for the 4 cases are shown in Fig. [TH Fig. [T5] shows the relative value of 
the sum download time by normalizing the lower bound to be 1 in order to explicitly compare 
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Fig. 14 

Sum download time versus Us for large P2P networks with N — 100 peers. 



the performances of the dynamic rateless-coding-based scheme and the static scenario. For 
Case I where peers have infinite downhnk capacities, the sum download time of the dynamic 
rateless-coding-based scheme is almost half of the minimum sum download time for the static 
scenario for a broad range of the source node uplink Vg. This result matches the results in the 
previous work ^27j, which says that the minimum sum download time of dynamic scenarios 
is almost half of the minimum sum download time of static scenarios when node uplinks are 
the only bottleneck in the network. Our results also show that the sum download time of 
the dynamic rateless-coding-based scheme with peer leaving decreases to almost half of the 
minimum sum download time for the static scenario as Vg increases. For Cases II, IV, and 
VI, the WSDs of the dynamic scheme and the dynamic scheme with peer leaving are also 
always smaller than the minimum WSDT for the static scenario. In particular, the WSDT 
of the dynamic scheme can be as small as 0.59, 0.70, and 0.73 of the minimum WSDT for 

58 



Case I 



Case I 



1.2 
1.1 

Q 

E 0.9 ; 

I 0.8 
■(5 

■5 0.7 
oc 

0.6 
0.5 




- Lower Bound 

- Rateless Code 
Routing 
Dynamic 

Dynamic with Peer Leaving 



■A.. A 

MW4*44WW ■ "ty. . ..-V. . -Vt. . ^ . . , 



100 200 300 400 
Source Node Upiini< Us 
(a) 
Case iV 



500 



1.05 
' 0.95 



0.9^^^ 



Q 
E 
cn 

g 0.85 
(5 

"S 0.8 

DC 

0.75 
0.7 W 



A" 



100 200 300 400 
Source Node Upiini< Us 
(c) 



500 



1.2 
1.1 

Q 

E 0.9 ■ 

3 

g 0.8 



0) 0.7 
tr 

0.6 
0.5 



O 

A' 



100 200 300 400 
Source Node Upiink Us 
(b) 
Case VI 



500 



1.05 
' 0.95 i 



Q 
E 



0.9 



w 

$ 0.85 

"S 0.8 
cr 

0.75 
0.7 



.A- 



..A- 



.A- 



.A"A"A"A-.,<i 



100 200 300 400 
Source Node Upiinl^ Us 
(d) 



500 



Fig. 15 

Relative sum download time versus Us for large P2P networks with N = 100 peers. 



the static scenario for Cases II, IV and VI, respectively. The WSDT of the dynamic scheme 
with peer leaving can be as small as 0.71, 0.82, and 0.86 of the minimum WSDT for static 
scenarios for Cases II, IV and VI, respectively. These largest improvements in percentage of 
deploying the dynamic scheme is obtained when the source node can directly support tens 
of the peers. 

The performances of WSDT versus Us with weight Wi = i/N {i = 1, ■ ■ ■ , N) are shown in 
Fig. [161 Fig. [T71 shows the relative value of the WSDT. For Case I, the sum download times 
of the dynamic rateless-coding-based scheme and the dynamic scheme with peer leaving can 
be even less than half of the minimum sum download time for the static scenario for a broad 
range of the source node uplink Us- This is because the peers with largest weight finish 
downloading first in the dynamic scheme. The WSDT of the dynamic scheme can be as 
small as 0.48, 0.49, and 0.58 of the minimum WSDT for the static scenario for Cases II, 

59 




Case I 

e — Lower Bound 
-B — Rateless Code 
Routing 
Dynamic 
A Dynamic with Peer Leaving 




100 200 300 400 500 
Source Node Upiini< Us 
(a) 
Case IV 



Lower Bound 
Rateiess Code 
Routing 
Dynamic 

Dynamic with Peer Leaving 



100 200 300 400 
Source Node Upiink Us 
(c) 



500 




60 

CD 

Q 40 

E 

^ 30 

T3 

I) 20 

CD 

*10 




Case II 

Lower Bound 
Rateless Code 
Routing 
Dynamic 

Dynamic with Peer Leaving 



■■^■■A...X.A..£..S..A..^S 

100 200 300 400 500 
Source Node Uplinl< Us 
(b) 
Case VI 



-e- 



- Lower Bound 

- Rateless Code 
Routing 
Dynamic 

Dynamic with Peer Leaving 



100 200 300 400 500 
Source Node Uplinl< Us 
(d) 



Fig. 16 
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100 peers and 



IV and VI, respectively. The WSDT of the dynamic scheme with peer leaving can be as 
small as 0.56, 0.62, and 0.77 of the minimum WSDT for the static scenario for Cases II, IV 
and VI, respectively. Note that for Case VI, the WSDT of the dynamic scheme with peer 
leaving is larger than that of the static scenario for small Ug. This is because the peers with 
larger uplink resource also have larger weight, and they finish downloading and leave from 
the network first. 

The performances of WSDT versus Us with weight Wi = 1 + 996{i > N/2) (i = 1, ■ ■ ■ , A^) 
are shown in Fig. [181 Fig. [19] shows the relative value of the WSDT. For Case I, the sum 
download times of the dynamic rateless-coding-based scheme and the dynamic scheme with 
peer leaving is around half of the minimum sum download time for the static scenario for 
a broad range of the source node uplink Ug. The WSDT of the dynamic scheme can be as 
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small as 0.58, 0.55, and 0.52 of the minimum WSDT for static scenarios for Cases II, IV 
and VI, respectively. The WSDT of the dynamic scheme with peer leaving can be as small 
as 0.64, 0.64, and 0.63 of the minimum WSDT for the static scenario for Cases II, IV and 
VI, respectively. Note that for this weight setting, the WSDT of the dynamic scheme with 
peer leaving is always smaller than that of the static scenario for Case VI. This is because 
the gain by finishing peers with larger weight is more than than the loss by the peers with 
larger uplink resource leaving from the network. 

Consider large P2P networks with N = 1000 peers. The performances of sum download 
time versus Us for the 4 cases are shown in Fig. [TH Fig. [15] shows the relative value of the 
sum download time. For Case I, the sum download time of the dynamic rateless-coding- 
based scheme is around 0.55 of the minimum sum download time for the static scenario for a 
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100 PEERS AND 



broad range of the source node uplink Ug- The sum download time of the dynamic rateless- 
coding-based scheme with peer leaving decreases to 0.70 of the minimum sum download time 
for the static scenario as Ug increases to 1000. The WSDT of the dynamic scheme can be as 
small as 0.57, 0.70, and 0.70 of the minimum WSDT for the static scenario for Cases II, IV 
and VI, respectively. 

VIII. Conclusion 

This paper considers the problem of transferring a file from one source node to multiple 
receivers in a P2P network with both peer uplink bandwidth constraints and peer down- 
link bandwidth constraints. This paper shows that the static scenario can be optimized in 
polynomial time by convex optimization, and the associated optimal static WSDT can be 
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achieved by linear network coding. Furthermore, this paper proposes static routing-based 
and rateless-coding-based schemes that closely approach a new lower bound on performance 
derived in this paper. 

This paper also proposes a dynamic rateless-coding-based scheme, which provides signif- 
icantly smaller WSDT than the optimal static scheme does. A key contribution for the 
dynamic scenario is a practical solution to the ordering problem left open by Wu. Our solu- 
tion is to recast this problem as the problem of identifying the peer weights for each epoch 
of the "piecewise static" rate allocation. 

The deployment of rateless codes simplifies the mechanism of the file-transfer scenario, en- 
hances the robustness to packet loss in the network, and increases the performance (without 
considering packet overhead). However, there still exist several issues for rateless-coding- 
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based scheme such as high source node encoding complexity, packet overhead, and fast peer 
selection algorithm for the dynamic scenario. The results of this paper open interesting 
problems in applying rateless codes for P2P applications. 

The optimal download time region (set of optimal download times) for one-to-many file 
transfer in a P2P network can be characterized by a system of linear inequalities. Hence, 
minimizing the WSDT for all sets of peer weights leads to the download time region. The 
set of peer weights can also be assigned according to the applications. For instances, for a 
file transfer application with multiple classes of users, assign a weight to each class of users. 
For an application with both receivers and helpers, assign weight zero to helpers and positive 
weights to receivers. Hence, the results of this paper in fact apply directly to one-to-many 
file transfer applications both with and without helpers. 
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